Distributed Search on Large NoSQL Databases

نویسندگان

  • Fernando G. Tinetti
  • Francisco Paez
  • Luis I. Aita
  • Demian Barry
چکیده

This work focuses on performance and scalability of different policies for solving queries on large noSQL databases with clusters. Distribution of data and queries are amongst the main problems, given the distributed nature of clusters: basically a set of networked computers. The basic centralized model (for both, data and processing) is used as a departure point and different distributed configurations are experimented with, in order to determine several guidelines for performance improvement. Apache Solr has been used for database management and search server. The current contents of the Wikipedia in Spanish (with about 4.5 GB) have been used as an example of a NoSQL database for experimentation.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Comparative Study of Column Oriented NoSQL Databases on Characteristics

NoSQL database, also called Not Only SQL, is an approach to data management and database design that's useful for very large sets of distributed data. The growing popularity of big data will compel many companies to use NoSQL databases instead of traditional database. Generally, there are three main types of NoSQL databases: key-value stores, column oriented databases and document based stores....

متن کامل

Data Migration: Relational Rdbms to Non-relational Nosql

As a part of achieving specific targets, business decision making involves processing and analyzing large volumes of data that leads to growing enterprise databases day by day. Considering the size and complexity of the databases used in today’s enterprises, it is a major challenge for enterprises to re-engineering their applications that can handle large amounts of data. Compared to traditiona...

متن کامل

Persisting big-data: The NoSQL landscape

The growing popularity of massively accessed Web applications that store and analyze large amounts of data, being Facebook, Twitter and Google Search some prominent examples of such applications, have posed new requirements that greatly challenge traditional RDBMS. In response to this reality, a new way of creating and manipulating data stores, known as NoSQL databases, has arisen. This paper r...

متن کامل

NoSQL Data Modeling Techniques

NoSQL databases are often compared by various non-functional criteria, such as scalability, performance, and consistency. This aspect of NoSQL is well-studied both in practice and theory because specific non-functional properties are often the main justification for NoSQL usage and fundamental results on distributed systems like the CAP theorem apply well to NoSQL systems. At the same time, NoS...

متن کامل

Comparisons Between MongoDB and MS-SQL Databases on the TWC Website

Owing to the huge amount of data in websites to be analysed, web innovative services are required to support them with high scalability and availability. The main reason of using NoSQL databases is for considering the huge amount of data and expressing large-scale distributed computations using Map-Reduce techniques. To enhance the service quality of customers and solve the problems of the huge...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011